A DTW-based DAG technique for speech and speaker feature analysis

نویسنده

  • Jingwei Liu
چکیده

A DTW-based directed acyclic graph (DAG) optimization method is proposed to exploit the interaction information of speech and speaker in feature component. We introduce the DAG representation of intra-class samples based on dynamic time warping (DTW) measure and propose two criteria based on in-degree of DAG. Combined with (l − r) optimization algorithm, the DTW-based DAG model is applied to discuss the feature subset information of representing speech and speaker in text-dependent speaker identification and speaker-dependent speech recognition. The experimental results demonstrate the powerful ability of our model to reveal the low dimensional performance and the influence of speech and speaker information in different tasks,and the corresponding DTW recognition rates are also calculated for comparison.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing DTW-Based and HMM-Based Text- Dependent Speaker Verification Algorithms

Speaker verification is among the widely used biometrics which usually offer more secure authentication for user access than regular passwords. In this final project, we study the DTW-based and HMM-based speaker verification algorithms and a comparison between them is made based on their performances on our recorded dataset. The two feature sets commonly used in Speech Recognition Systems, LPC ...

متن کامل

Effect of Dynamic time Warping Based Alignment on the Accuracy of the Transformation Function for Voice Conversion

Absract--Voice conversion involves transformation of speaker characteristics in a speech uttered by a speaker called source speaker to generate a speech having voice characteristics of a desired speaker called the target speaker. Voice conversion is used in many applications namely dubbing, to enhance the quality of the speech, text-to-speech synthesizers, online games, multimedia, music, cross...

متن کامل

Fast Speaker Recognition using Efficient Feature Extraction Technique

Digital processing of speech signal and speaker recognition algorithm is very important for fast and accurate automatic voice recognition technology. A direct analysis of the voice signal is complex due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. The non-para...

متن کامل

Linear and non-linear fusion of ALISP-based and GMM systems for text-independent speaker verification

Current state-of-the-art speaker verification algorithms use Gaussian Mixture Models (GMM) to estimate the probability density function of the acoustic feature vectors. They are denoted here as global systems. In order to give better performance, they have to be combined with other classifiers, using different fusion methods. The performance of the final classifier depend on the choice of the s...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003